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ABSTRACT 

Radio jet and core data for a complete sample of 98 FRII sources with z < 1 are analysed 
with a Markov-Chain Monte Carlo (MCMC) model fitting method to obtain constraints on 
bulk-flow speeds in the beam. The Bayesian parameter-inference method is described and 
demonstrated to be capable of providing meaningful constraints on the Lorentz factor at both 
kiloparsec and parsec scales. For both jets and cores we show that models in which some 
intrinsic dispersion is present in the features' intrinsic prominence, bulk-flow speeds or both 
provide the best fit to the data. The constraints on the Lorentz factor on parsec scales are found 
to be consistent with the expected values given VLBI observations and other evidence, with 
7 « 10-14. On kiloparsec scales, the Lorentz factor is found to be 1.18 -1.49, in agreement 
with the results of previous analyses of radio jet data. These values are clearly not consistent 
with the 7^10 speeds required by beamed inverse-Compton models of X-ray emission from 
quasar jets; our results therefore support models that require velocity structure in powerful 
jets. 
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1 INTRODUCTION 

1.1 Relativistic beaming of jet emission 

Fanaroff & Riley (1974) type II quasars and radio galaxies (here- 
after FRIIs) generally exhibit double radio lobes with scales of tens 
to hundreds of kiloparsecs that are symmetrical about the source's 
central engine. There is now a very large amount of evidence for 
models (e.g. Scheuer 1974; Blandford & Rees 1974) in which these 
are supplied with energy, mass, momentum and magnetic flux by a 
bipolar, symmetrical, continuous flow of material - the 'beam' or 
'beams'. These outflows must persist and be well-collimated out to 
the 100-kpc scales of the lobes in order to give rise to the observed 
compact terminal hotspots. However, the observational signatures 
of these collimated outflows, 'jets' (e.g. Bridle & Perley 1984), are 
not always detected in FRIIs, either in the radio or at other wave- 
bands. Where jet emission is observed, the jet is very often 'one- 
sided': i.e., it is either detected on one side of the source only or 
is very much brighter on one side. This is particularly character- 
istic of powerful FRII quasars. As the presence of twin beams is 
suggested by the morphology of the large scale structure, the fact 
that the detection of both jets in these powerful sources is so un- 
common supports the hypothesis that the beam's emitting material 
is moving at relativistic speeds on kpc scales and that the emis- 
sion is affected by Doppler boosting ('beamed'). For the case of 
a source where the beam axis makes a relatively small angle to 
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the line of sight, as would be the case in standard unified models 
(Barthel 1989) for an FRII quasar, emission associated with the ap- 
proaching jet is then Doppler-boosted and observable, while that 
associated with the receding jet is Doppler-suppressed and not de- 
tected. For a source where the beam axis lies close to the plane of 
the sky, both jets are likely to be Doppler-suppressed. 

The key piece of evidence for relativistic beaming on kpc 
scales comes from the tendency for the jet side to be associated 
with the less depolarised lobe of the source, the Laing-Garrington 
effect (Garrington et al. 1988, Laing 1988). The depolarizing mech- 
anism is believed to be an external Faraday screen in which the 
source is embedded, presumably the hot phase of the intergalactic 
medium, and thus the degree of depolarization observed depends 
upon the path length of the radiation through the screen. The less 
depolarized lobe is therefore expected to be the nearer lobe, or the 
lobe pointing towards us, and any correlation with the kiloparsec jet 
requires that the jet emission is Doppler-boosted. However, while 
observations of the Laing-Garrington effect require relativistic jet 
speeds on kiloparsec scales, they do not tell us what these speeds 
are. 

Direct evidence for relativistic flow speeds in the inner regions 
of the beams, on parsec and sub-parsec scales, comes from VLBI 
observations of apparent superluminal motion. For example. Hough 
et al. (2002) have mapped the parsec scale regions of a complete 
sample of 25 lobe-dominated quasars (defined as having a ratio of 
nuclear to extended flux density at 5 GHz of less than 1) from the 
sample of Laing, Riley & Longair (1983, hereafter LRL). Of these 
sources, all have resolved structure on parsec scales, with one-sided 
jets detected on the same side as that of the kiloparsec jet where 
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such a jet is detected. Hough et al. estimate bulk flow speeds with 
Lorentz factor, 7, ~ 5 — 10 from multi-epoch observations, and 
these results are consistent with those from other observing pro- 
grammes. Further observational support for the idea of high bulk 
speeds in radio-loud AGN in general comes from the rapid vari- 
ability and consequent high brightness temperatures of parsec-scale 
features, the absence of very strong inverse-Compton emission in 
X-ray observations of the nucleus, and transparency to high-energy 
7 rays, which together imply bulk Lorentz factors 7 > 2 and possi- 
bly as high as ~ 50 (Begelman, Fabian & Rees 2008). Arguments 
based on unification and population statistics (mostly in low-power 
objects; e.g. Chiaberge et al. 2000, Hardcastle et al. 2003) imply 
7 3: a plausible explanation for the widely differing 7 values 
is that there is velocity structure in the parsec-scale jet. However, 
the essential point for our purposes is that all parsec-scale esti- 
mates agree on the need for Lorentz factors corresponding to speeds 
greater than ~ 0.9c, and the direct VLBI estimates imply speeds 
> 0.99c in general. It is therefore important to ask whether these 
speeds persist to the kiloparsec scale. 

Two approaches to determining the kiloparsec-scale jet speed 
have been taken in the literature. The first uses the radio proper- 
ties of the jets. This approach was pioneered by Wardle & Aaron 
(1997, hereafter WA97), who analysed the observed jet flux asym- 
metry in the 13 quasars imaged by Bridle et al. (1994). The jet flux 
asymmetry was defined as the observed jet flux over the counter- 
jet flux (where the counter-jet is the fainter of the two), but as for 
most sources no counter-jet was actually detected, many of their 
data points are actually lower limits. Taking into account the pos- 
sibility of some intrinsic asymmetry, they simulated a number of 
data sets that were compared to the observed data by means of 
a Kolmogorov-Smimov (K-S) test. They found that the observed 
data are best fitted with 0.6 < P < 0.8, where /3 is the jet speed 
as a fraction of the speed of light. However, because their parent 
sample was not complete, they were forced to take quite a complex 
approach to the inclusion of the selection criteria in their analysis, 
and effectively to treat the upper limit on angle to the line of sight 
made by the beam axes of their sources as a free parameter in their 
fits. Hardcastle et al. (1999, hereafter H99), used a similar method 
to constrain the jet bulk-flow velocities for their sample of FRIls 
with z < 0.3, which overlaps considerably with the sample con- 
sidered in the present paper (see Section l2Tt . Rather than using 
jet sidednesses they used the jet and core prominences (defined as 
the ratio between the jet/core flux density and that of the extended 
emission: see Section l2!4t . Because their sample was complete the 
source orientation could be assumed to be random, simplifying the 
analysis with respect to WA97's work. In their analysis the free pa- 
rameters were the intrinsic prominence, pint, and j3 and, exploring 
a grid in these two parameters and using K-S tests in the same way, 
they derived speeds between 0.5c ~ 0.7c. Arshakian & Longair 
(2004) used an analytic approach to the H99 data to infer slightly 
lower speeds, /3 ~ 0.4, on the basis of the jet sidedness distribu- 
tion, while constraining /3 > 0.6 for the sample used by WA97. 
Thus all the approaches based on the distribution of the observed 
properties of the radio jets to date have been consistent in implying 
only moderately relativistic bulk speeds, /3 ~ 0.5 ± 0.1. 

However, a different approach is motivated by the widespread 
detection of strong X-ray emission from the jets of core-dominated 
quasars, believed to be the highly aligned counterparts of the FRII 
radio galaxies and lobe-dominated quasars studied by WA97 and 
H99. Following the discovery of the prototype of this class, PKS 
0637—752 (Schwartz et al. 2000), it was quickly realised that the 
broad-band spectra of these object preclude a one-zone synchrotron 



model for the radio through X-ray data, while various inverse- 
Compton models for the X-rays require extreme departures from 
equipartition for a non- or mildly relativistic jet. Instead, the model 
proposed independently by Tavecchio et al. (2000) and Celotti, 
Ghisellini, & Chiaberge (2001) is widely adopted. In this model, 
the jet is moving relativistically, with a bulk Lorentz factor 7^1. 
As seen by the jet, the energy density in the microwave background 
increases by a factor of the order 7^, and this increases the emissiv- 
ity of the inverse-Compton scattering of the microwave background 
(hereafter CMB/IC) in the jet frame; in PKS 0637-752, crucially, 
the 7 (~ 10) required for the kpc-scale jet is very similar to that 
inferred from VLBI studies of superluminal motion in the nucleus. 
The emission from this process is strongly anisotropic and so is 
only visible in core-dominated objects, but, in unified models, the 
jets in lobe-dominated quasars and FRII radio galaxies must have 
comparable speeds. The implication of 7 ^ 1 in the kpc-scale jets 
of all radio-loud objects is in strong contrast to the results of the 
prominence/sidedness analyses described above. 

How can these two very different estimates of the kpc-scale 
jet speed be reconciled? There is still disagreement in the litera- 
ture over whether the beamed CMB/IC model really does describe 
all, or even any, of the observed quasar jets (e.g. Stawarz et al. 
2004, Hardcastle 2006, Jester et al. 2007). If it does, then, as ar- 
gued by Hardcastle (2006), velocity structure in the kiloparsec- 
scale jets and perhaps bulk deceleration on hundred-kpc scales 
seem inevitable consequences. At the same time, though, the exist- 
ing work on the radio data is open to a number of criticisms. The K- 
S test is not really adapted to model fitting (that is, it is not obvious 
that maximizing the K-S test null hypothesis probability really cor- 
responds to maximizing the likelihood). More seriously, the large 
number of jet or counterjet non-detections and consequent limits 
on sidedness or prominence measurements are hard to take into ac- 
count either in the K-S method of WA97 or H99 or in the analytical 
method of Arshakian & Longair (2004), although H99 attempted 
to assess the effect of the limits in their sample by scaling them and 
argued that they did not have a strong effect. In all cases the sample 
sizes are small; additionally, WA97's sample has complex selection 
effects while H99's sample is low-luminosity and contaminated by 
low-excitation radio galaxies whose role in unified models is not 
clear. For all these reasons, it is worth revisiting the radio-based 
estimates of kpc-scale jet speeds with new data and new analysis 
techniques: the present paper presents the results of such a study. 

1.2 This paper 

The present paper is based on the work of Mullin, Riley & Hard- 
castle (2008, hereafter Paper I). In that paper we presented a de- 
tailed study of the observed properties of a complete sample of 
FRII sources, including the kiloparsec-scale jets and core features. 
We concluded that the observational evidence supports the beaming 
hypothesis and that, while there is stronger evidence for relativistic 
speeds on parsec scales, the observed correlation between jet and 
core brightness implies the extension of high bulk-flow speeds into 
kiloparsec scales. 

In this paper we use these jet and core data together with a 
Bayesian inference method in order to constrain Lorentz factors 
in the beams on parsec and kpc scales. Our approach is free from 
many of the disadvantages of earlier work. We do not carry out 
systematic grid searches of parameter space, and so are not limited 
to a small number of model parameters: this allows us to deal with 
the case in which the intrinsic prominence and speed distributions 
are not delta functions but themselves have some intrinsic scatter. 
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Crucially, we can also treat the limits in the data properly, rather 
than treating them as measurements. In addition, our dataset is a 
factor ^ 2 larger than that of H99, and contains luminous quasars 
and powerful radio galaxies which are well matched to the core- 
dominated quasars for which high bulk Lorentz factors have been 
inferred. This paper therefore represents a significant improvement 
over previous work. 

The remainder of the paper is structured as follows. In the fol- 
lowing Section the dataset is described, while the analysis method 
and performance of the code used is discussed in Section[3] Results 
are presented in Section |4] and the discussion and conclusions are 
in Section[5] 

Throughout the paper we use the quantities measured and 
calculated in Paper I: this implies the use of a cosmology with 
//o = 70 km s"^ Mpc~\ Q.^ = 0.3 and Q.k = 0.7. All sym- 
bols used are summarized in Table[T] 



2 THE DATA 
2.1 The sample 

The sample is that of Paper I, which consists of the 98 FRII radio 
galaxies and quasars with 2 < 1 in the sample of LRL. This is a 
complete flux-limited sample, including all 3CR sources observed 
to have Sns > 10.9 Jy (on the scale of Baars et al., 1977) with 
declination > 10° and |&| > 10°, where Sm is the total source 
flux measured at 178 MHz. At this low frequency the source flux is 
dominated by the diffuse emission of the large scale lobe structure 
and as such little contribution should be made by Doppler-boosted 
components; thus the selection criterion should ensure that the sam- 
ple is not biased with respect to orientation. 

A long-term observing project has mapped the vast majority 
of these 98 sources at high resolution and sensitivity with the VLA 
telescope. Data from this project have been presented in a series of 
papers: Black et al. (1992), Leahy et al. (1997), Hardcastle et al. 
(1997), Gilbert et al. (2004) and, most recently, Mullin, Hardcastle 
& Riley (2006). Observations of the outstanding sources have been 
made by us or by other workers and the data are available in the 
literature; all references to the data used here are given in Paper 
I. In addition, the data - maps and measurements - from Paper I 
are now available on- linfl The subsample of sources with z < 
0.3 overlaps considerably with that of H99, and the measurements 
of H99 are used for the sources that we have in common, but the 
larger redshift range of our sample gives us a factor 2 more sources 
(improving the statistical significance of our results) and means that 
15 FRII quasars are included, complementing the data on broad- 
and narrow-line radio galaxies from H99's work. 



2.2 Jets 

The definition of the term jet and a discussion of the data and mea- 
surements is given in Paper I. Here, the jet criteria are reiterated: 
they are based on those of Bridle & Perley (1984). Thus, a jet is 
any feature that is 

(i) at least four times as long as it is wide; 

(ii) separable at high resolution from other extended structures 
(if any), either by brightness contrast or spatially (e.g. it should be 
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a narrow ridge running through more diffuse emission, or a nar- 
row feature in the inner part of the source entering more extended 
emission in the outer part). 

In some sources jets appear to bend, and this causes problems for 
analysis in terms of beaming models, which must assume a single 
angle to the line of sight 6, as discussed by Bridle et al. (1994). 
We follow Bridle et al. (1994) and H99 in defining the straight jet, 
which satisfies the above two criteria but also must be aligned with 
the compact radio core where it is closest to it (and is measured 
from the end closest the core along its length only while the devi- 
ation from a straight line is less than the jet radius). Only the flux 
density of the straight jet is used in the analysis in this paper. 

In practice, the straight jet is taken to be the longest straight 
section of the jet in the source that is aligned with the core. Us- 
ing the AIPS task TVSTAT, the integrated flux within the region 
containing the apparent jet emission. Fobs, was found. Background 
flux was corrected for by taking measurements of two regions iden- 
tical in size to the initial jet measurement to the immediate right 
and left of the feature. The average of these, Bobs, was then sub- 
tracted from the jet measurement to give the observed jet flux, 
Jobs ~ Fobs — Bobs. In order to get the best estimate of Jobs, three 
values of jet flux were taken this way and averaged. The great- 
est source of error in the jet measurement is considered to arise 
from the ambiguity in defining the jet emission itself: the errors 
quoted are therefore based on the range of the three jet measure- 
ments made. Where no jet emission is detected, an upper limit is 
estimated by measuring the integrated flux of a region ~ 2 restor- 
ing beam widths across the entire distance between the core and pri- 
mary hotspot region. Background flux is corrected for in the same 
manner as for the detected jets by taking two further integrated flux 
measurements either side of the initial region. However, if the flux 
associated with the central region is not the highest of the three, 
then the upper limit estimate is taken to be the positive difference 
between the central measure and the lower of the other two. 

The sample extends over a large range in redshift and was ob- 
served using a variety of different telescopes and telescope config- 
urations, which means that the effective (spatial) observing reso- 
lution is far from constant across the sample. Observational effects 
on jet detectability were considered in Paper I, where we concluded 
that, although observing resolution is clearly a factor in jet visibil- 
ity, there is no simple systematic bias across the sample nor any 
trend with redshift. We therefore do not expect that the variations 
in effective observing resolution will affect the robustness of the 
results of any analysis of Pobsj . 

2.3 Cores 

Paper I also contains a detailed discussion of core measurements. 
The core measurements were obtained from the highest-resolution 
map available for a given source using the AIPS task IMFIT, which 
fits an elliptical Gaussian model of between one and four compo- 
nents to a feature. One component was fitted and the peak inten- 
sity found was taken as the core flux. As most cores in the sam- 
ple were unresolved at all resolutions such a model fitted the data 
well. Errors were determined from the square root of the average of 
the squared formal errors returned from the fitting procedure. For 
around two thirds of the sample this error is less than 2 per cent of 
the core flux, so the calibration error (expected to be 2-3 per cent) 
will dominate. Errors quoted therefore correspond to 3 per cent of 
the core flux measurement, unless the formal error from JMFIT is 
greater, in which case the latter is quoted. Cores were undetected 



4 L. M. Mullin & M. J. Hardcastle 



in only seven sources, and in these cases estimates of upper limits 
were made from the off-source noise. 

Again, observing effects were considered for the core promi- 
nence data in Paper I. We concluded that, as the cores are typically 
bright, unresolved features that are generally much brighter than 
any lobe or jet material with which they might be confused at low 
resolution, observational effects are not a source of systematic bias 
in the analysis of Pobsc ■ 

2.4 Prominence 

In order to analyse the jet and core data the observed jet and core 
flux density data are extrapolated to a common frequency of 8.4 
GHz and then K-corrected assuming the spectral index, a,= 0.5 
for jets and = for cores, before being converted to a luminosity 
using the relation 

P ^ R'{1 + zfS (1) 

where P is the jet or core luminosity, R the proper distance (as cal- 
culated using the ANGSI^code) and S the jet or core flux density. 
The total source flux measured at 178 MHz is K-corrected using the 
low-frequency spectral index, aij (appropriate over the range 178 
- 750 MHz), and converted to a luminosity, Pirg. The jet and core 
prominence are then the ratio of the corresponding feature's lumi- 
nosity to Pi 78. Observed jet and core prominences for our sample 
are plotted in Figs[T]and|2] 

This definition of prominence, which we used in Paper I, as- 
sumes that the total source flux as measured at 178 MHz is uncon- 
taminated by beamed components. It is expected that the sample 
will be dominated by the extended lobe emission at this low fre- 
quency; however, in sources with very bright jet or core features 
it is not clear that no contamination exists and this is potentially a 
source of bias in the prominence values. We investigated whether 
there is any evidence for such contamination in our data by com- 
paring prominence values evaluated as described above with those 
evaluated with a modified total source flux, that is, a total source 
flux corrected by subtracting off jet and core features. 

The modified total source flux at 178 MHz was calculated by 
extrapolating the jet and core fluxes from the observed frequency 
to 178 MHz. Here, the total jet was used, that is, the feature that 
satisfies the jet conditions as specified in Section [Z2l and not the 
more restrictive straight jet conditions (see Paper I for a more de- 
tailed description of total and straight jet definitions). Both jet and 
counter-jet features were extrapolated back to 178 MHz. The cor- 
rected fluxes were then subtracted from the total source flux, and 
this modified total source flux was K-corrected and used to evalu- 
ate the core and straight jet prominence as described above. 

Plotting these alternative jet and core prominence values 
against those initially determined, there was no evidence that either 
the core or jet prominence values were greatly affected by contam- 
ination of beamed components if the total source flux at 178 MHz 
is used without first correcting it - the correlation in the plots is 
linear and there is no trend for sources to curve away from this 
line. Additionally, considering the errors in the jet and core flux 
measurements, the difference in prominence values made by modi- 
fication of the total source flux is low - in particular, the difference 
in the jet prominence made by modifying the total source flux is 
typically a fraction of a percent of the quoted error. 

From this we conclude that using the 178 MHz total source 
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flux without making any attempt to remove beamed emission to 
define prominence, as we did in paper I, will not affect the robust- 
ness of our analysis. We therefore use the unmodified prominence 
for consistency with Paper I. 

Finally, we note that in Paper I a spectral index of 0.5 was used 
for jets, despite the fact that higher values of ~ 0.8 have typically 
been used by other workers (H99, for example). Hotspot features 
are expected to be associated with much flatter spectra than jets 
and it was decided in the previous analysis, given that trends in 
core, jet and hotspot properties as well as correlations between the 
features were being considered, that a spectral index of 0.5 should 
be used for both jet and hotspot features. The analysis presented 
here uses the jet prominence data of Paper 1 ; however, we consider 
the effects of varying a for jets in Section l4~T] 



3 DATA MODELS 

3.1 Doppler boosting of source emission 

The emission from any component of a radio source that is travel- 
ling at a significant fraction of the speed of light with respect to an 
Earth-bound observer will be anisotropic due to relativistic beam- 
ing even if it is isotropic in the rest frame (in this paper we neglect 
the minor effects due to intrinsic anisotropy of emission in jets: see 
H99 for more discussion). As received on Earth, a feature will have 
an observed flux density, S^^^^ , given by 

= S._,(7[l - /3cose])-'"+'^ (2) 

(e.g. Ryle & Longair 1967; Bridle et al. 1994) where S^^^^^ is the 
flux density of the feature in the emitter's rest frame, (3 is the frac- 
tion of the speed of light at which the emitter is traveling, 7 is the 
Lorentz factor (=1/ \/{}-~]P)), 9 is the angle of the velocity 
vector to the line-of-sight, a is the spectral index of the radiation 
(where S oc and m is a constant reflecting the geometry 

of the beamed component. Following Scheuer & Readhead (1979) 
and Lind & Blandford (1985), the value of m taken here to be ap- 
propriate for a continuous jet is 2. As the spectral index appears in 
the exponent of the Doppler factor, it follows that, all other things 
being equal, a larger assumed a gives rise to a stronger beaming 
effect. 



3.2 Jets 

Applying equation ^ to the jet features in the sample, we obtain 
the relationship between the observed jet prominence, pobsj , and 
the intrinsic jet prominence, pintj : 

fobsj =Pmtj[7(l -/JcOSe)]"^"''" (3) 

where a is the adopted spectral index of the jet, pint is the intrinsic, 
rest-frame prominence of the jet feature and is the orientation 
of the jet with respect to the observer's line of sight (7, /3 are as 
before). Here we assume that the normalizing luminosity (Pits in 
our case) is unaffected by beaming, as discussed above. 



3.3 Cores 

The core emission is thought to originate in the inner parsecs of 
the beam, that is, the parsec-scale bipolar jets, and so the model of 
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equation l|2j can be applied to the observed core prominence in the 
following manner, assuming that for cores a = 0: 

Pobsc = Pintj7a(l - /3aCOs6la)]~^ 

+Pintj7r(l-/3rCOSe»0]"^ (4) 

where the subscripts 'a' and 'r' correspond to the approaching and 
receding parsec scale jet respectively. Again the subscript 'int' indi- 
cates the intrinsic, rest frame prominence of these jets and 9, 7 and 
P are as before. In the simplest models, which we adopt throughout 
the paper, pint, = Pint, = 0.5 x pint,, 5, = 180° - = 180 - 61 
where d has the same value as for the kpc-scale jet, — Pa. — Pc 
and 7r = 7a = 7c- 

3.4 Model Fitting Metliod 

3.4.1 Beaming Model Validity 

As discussed in Section 11.11 previous workers have used jet and 
core models of the form given in equations ^ and l|4} to constrain 
7 values. An advantage of the present analysis method over these 
studies is the possibility to allow for distributions in the intrinsic 
prominence and 7 parameters. Arguments have been made (e.g. 
Urry & Shafer 1984) that the intrinsic jet or core prominence might 
be expected to be a fixed fraction of the intrinsic total source flux, 
but, while this is a useful simplifying assumption, it is more likely 
that there will be a range in intrinsic prominence values in the sam- 
ple, since if nothing else there will be scatter in the relationship be- 
tween the total (normalizing) luminosity and the intrinsic jet power. 

In the present paper the nature of such distributions is assumed 
to be normal or log-normal in the case of jets, since such distribu- 
tions are appropriate to cases in which the observed values are the 
sum, or product, of many variable factors. This is also the case for 
the cores, but in this case we also consider a model based on power- 
law distributions. There have been a number of studies in the liter- 
ature of samples of VLBI observations in terms of the apparent 
velocities and luminosities, often including model fitting to obtain 
information on the probable 7s and luminosity functions. In these 
studies, a power-law distribution for 7 is often assumed. For exam- 
ple, Urry & Padovani (1990) consider the effects of allowing such 
a distribution in the bulk Lorentz factors in their analysis, arguing 
that it allows both low or high values of 7 to be favoured, but also 
that a wide Gaussian distribution would resemble a flat power-law. 
Subsequent studies, such as Lister & Marscher (1997), and more 
recently Cohen et al. (2007), find that model fitting based on such 
distributions is consistent with the data, though no evidence that 
such a model is to be favoured is reported. Given this other work, 
applying a power-law distribution to the core data provides an in- 
teresting comparison to models based on a normal distribution. A 
uniform distribution in cos 6 is used for 9, assuming that the sample 
sources are randomly aligned with respect to the observer's line of 
sight. 

Examples of simulated data sets using this approach and the 
beaming models of equations Jsj ^rid Q are plotted together with 
the observed data in Figs [T] and |2l It can be seen that the shape of 
the resulting distributions can give a good representation of the true 
observed distributions. (Note that the simulated data sets shown are 
not fitted to the data in any way other than by simple scaling, though 
the parameters used are representative of those we obtain in subse- 
quent sections by fitting.) The observed jet and core prominence is 
strongly determined by 7 in each case. The effect of beaming on 
the shape of the observed prominence with respect to the normally 
distributed intrinsic prominence is to suppress the observed emis- 















\ 


\ 
















1 
























1 


i 


i 


4/' 






mi . .]J. 


/ 

















10 ^ 10 * 10 0.01 

Jet prominence 



Figure 1. Histogram of the observed jet prominence data, Pobsj ■ Filled re- 
gions indicate measurements and empty regions upper limits. The data are 
compared (solid lines) with the expected distribution jet prominence from 
a simulation with 7 = 1.5 (broad curve, red) and the corresponding distii- 
bution of intrinsic prominence (narrower curve, green), renormalized to a 
common maximum value for convenience of presentation. Pintj has a log- 
normal distribution with a = 0.3 (in units of the natural logarithm) around 
In(pintj) = -8. 
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Figure 2. Histogram of the observed core prominence data, Pobsc ■ Regions 
and curves are as in Fig. [T] Here the simulated data have 7 = 10.0 and 
Pintc has a lognormal distribution with cr = 0.8 (in units of the natural 
logarithm) around In(piiitj) = —5.5. 



sion in many sources, while boosting it in a smaller fraction. As 
7 increases this effect is more pronounced. Changes in the mean 
of the intrinsic prominence distribution affect the observed promi- 
nence distribution's location on the s-axis but not its shape: it acts 
as a scale factor. Changes in the intrinsic scatter that we assume 
broaden the distribution (and are thus to some extent degenerate 
with changes in 7) but also tend to smooth out the resulting distri- 
bution. 

Therefore, while the problem is degenerate, with at least three 
unknown parameters (intrinsic prominence, orientation and 7), it is 
clear that analysis of the observed prominence distribution can tell 
us something about beaming in the sample. 
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Table 1. Glossary for symbols used. 



Symbol Parameter Definition 

~S speed as a fraction of the speed of light Section ll.il 

Pint intrinsic prominence of the jet or core feature Section fTTT] 



Sl78 


total source flux measured at 178 MHz 


SectionlzT] 


-^obs 


measured jet flux 


SectionlzH 


Sobs 


jet background flux correction 


Sectioning] 


-^obs 


background-corrected jet flux 


SectionlTll 


a 


spectral index 


Section[Z4l 


P 


jet or core observed luminosity 


Section[Z4l 


R 


proper distance 


Section[24l 


S 


jet or core flux density 


Section[2!4l 


a\i 


low frequency spectral index 


Section[Z4l 


Pns 


source luminosity, as measured at 178 MHz 


Section[2!4l 




observed flux density 


Sectionim 


•S'l'reat 


flux density in emitter's rest frame 


Sectionim 


7 


bulk Lorentz factor 


SectionIXT] 


m 


constant reflecting the geometry of the beamed component 


Sectionim 


Pobaj 


observed jet prominence 


Sectionim 


Pintj 


intrinsic jet prominence 


Section[T2l 


e 


angle of emitter's trajectory with respect to observer's line of sight 


Section[T2l 


PobSc 


observed core prominence 


Section[331 


Pintc 


intrinsic core prominence 


Section[331 


Pinta 


intrinsic prominence of approaching parsec scale jet 


Section[331 


Pint, 


intrinsic prominence of receding parsec scale jet 


Section[331 


7a 


bulk Lorentz factor of approaching parsec scale jet 


Section[331 


7r 


bulk Lorentz factor of receding parsec scale jet 


Section[331 




approaching parsec scale jet speed as fraction of speed of fight 


Section[331 


Pr 


receding parsec scale jet speed as fraction of speed of figfit 


Section[331 


0a 


angle of approacfiing emitter's trajectory with respect to observer's line of sight 


Section[331 


0r 


angle of receding emitter's trajectory with respect to observer's line of sight 


Section[33l 



H 
D 
I 

P{H\D,I) 
PiH\I) 
P(D\H,I) 
P{D\I) 

Pmod 

P{PTaod^_i_l Ipmodt) 
Pcan 

Q(Pcan|Pmodt) 
0!(Pmodt,Pcan) 

r 



7 

7min 
7m ax 
Pint 

a 



X 
M 
Oik 



hypothesis 

observed data 

prior information about D 

posterior probability of H given D and / 

prior probabifiy of H 

likelihood of D given H and / 

evidence 

expected observed prominence corresponding to model parameter values 

transition probability 

candidate value for next chain step 

proposal distribution 

acceptance probability 

metropolis ratio 

factor by which the posterior is scaled in bum-in 

mean bulk Lorentz factor 

upper limit on 7 

upper limit on 7 

mean intrinisic prominence 

intrinsic dispersion in 7/7 

intrinsic dispersion in prominence 

power-law index for power-law Lorentz factor distribution 

general symbol for parameters 
model 

Bayes factor 



Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 
Section 

Section 
Section 
Section 
Section 
Section 
Section 
Section 

Section 
Section 
Section 



3A21 



3A2I 



3A2I 



3A21 



3A2I 



3A2I 



3A21 



3A2I 



3A2I 



3A21 



3A4I 



3.4.41 



JAA] 



3A4I 



3A4I 



3.4.41 



3A4^ 



3A6I 



3A61 



7c 



mean bulk Lorentz factor for the jets 
mean bulk Lorentz factor for the cores 



Section|431 
Sectionl431 
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3. 4. 2 MCMC approach 

The discussion in the following sections is only a very brief in- 
troduction to the MCMC approach to Bayesian inference; more de- 
tailed treatments can be found in the literature [e.g., Gregory (2005) 
and references therein]. 

We can obtain the posterior probability of a hypothesis H 
from Bayes' theorem: 



P{H\D,I) = P{H\I) 



P{D\H,I) 
P{D\I) 



(5) 



where D is the observed data and / is the prior information we have 
about D, P{H\I) is the prior probability of H, P{D\H,I) is its 
likelihood and P{D\I) is the normalization factor, the 'evidence'. 
In the present problem, we have observed prominence data, Pobs. 
If we define a parameter space in terms of pint, 7 and 9, we can 
evaluate the expected observed prominence, Pmodi, correspond- 
ing to a parameter set drawn from this space using equation ([3) 
or l|4j as appropriate. We can then determine the posterior prob- 
ability of Pmodi, f (PmodJPobs,/), by evaluating P(pmodJJ), 

P{Poh!i\Praodi,I) and P(Pobs|/)- 

The obvious problem is that, with a minimum of three pa- 
rameters, the exhaustive computation of the probability distribu- 
tion corresponding to the defined parameter space is not feasible. 
Straightforward Monte Carlo could provide a good approximation 
- the method is to draw uniform, randomly distributed, independent 
samples from the distribution and its accuracy is determined by the 
number of these samples. However, this approach is still compu- 
tationally expensive and much time can be spent in regions where 
the probability is very small. Instead, MCMC is more efficient as 
it exploits the fact that samples need not be drawn independently if 
they are generated from the target distribution or some function of 
it, here P(j3mod|Pobs, I), in the correct proportions. 

The Metropolis-Hastings algorithm is used to apply the 
method. This algorithm generates a sample set by constructing a 
walk through parameter space in which the probability of a sam- 
ple's being in some region of space is proportional to the posterior 
density for that region. It does this by determining the next chain 
step, which in this case is Pmodt+i, with respect to its probability 
given the current chain step, Pmodj , through the evaluation of the 
transition probability (or kernel), P(pmodt+i |Pmodt ). 



The algorithm chooses a candidate value for Pn 



Pcan, 



from a proposal distribution, Q(pcan|Pmodf ) that is understood 
and easy to evaluate (see Section l3.4.3t . pcan is accepted or re- 
jected as Pmodt+i as determined by the acceptance probability, 
o(Pmodt ,Pcan) which Can bc expressed as 



— min(l,r) (6) 

— miri ( 1 -P(?'<:an|-D, /) Q(Pmodt |Pcan) 



Q(Pmodt,Pc 



-P(Pmodt l-D, /) Q(Pcan|Pmodt), 

where r is the Metropolis ratio. If r > 1, then pcan is accepted and 
Pmodt+i = Pcan. If r < 1, then a random variable U is sampled 
from a uniform distribution in the interval to 1. In the case that 
U < r, Pmodj+i ~ Pcan, othcrwlsc pcan is rejected. The transition 
kernel, the probability that the algorithm will draw and accept a 
sample Pmodt+i given the chain's present state, is then 

-P(Pmodt+i IPmodJ = (5(Pmodt+i|Pmodt)Q(Pmodt,Pcan) (7) 

As a first consideration, we might want the proposal distribution, 
(3(Pcan|Pmodt), to be the target distribution itself - but of course, 
this is unknown as it is this that we are trying to evaluate. However, 
if the Markov chain is irreducible, aperiodic and positive recurrent 



then it can be shown that there exists a stationary distribution from 
which all samples will be drawn subsequently once one initial sam- 
ple is drawn; thus the algorithm will converge on this stationary 
distribution for a wide range of proposal distributions. 

The probability of drawing pmodt from the posterior is 
P(Pmodt 1^5, /) and the probability that Pmodt+i is subsequently 
drawn and accepted is given by the joint probability of Pmodt and 
Pmodt+i: P(Pmodt,Pmodt+i), wMch Can be expressed as the fol- 
lowing: 

-P(Pmodt,Pmodt+i) = -P(Pmodtl-D,/)P(p„iodt+i|Pmodt) (8) 

Expanding this out using equations ([6j and Q, gives the detailed 
balance equation: 



P(p„,od,|£',/)P(p„ 



tl\Pn 



= P(Pmodt + i \D, /)P(pmodt IPmodt 
(9) 

From this it can be seen that the stationary distribution is the target 
distribution of the chain, irrespective of the proposal distribution - 
(5(Pcan|Pmodt) - initially used. The sampling process before the 
stationary distribution is reached is referred to as bum-in. 

The algorithm allows the chain to move to regions of increas- 
ing probability while sometimes accepting a chain step of lower 
probability. This contributes to the efficient exploration of parame- 
ter space as the chain can move away from regions of local maxima. 
However, for a target posterior distribution of a multi-dimensional 
problem this flexibility will not necessarily be sufficient to prevent 
the chain becoming stuck in a local maximum region; the results 
will still be dependent on the starting sample. In practice we use 
multiple chains with information exchange (see the next section) 
but we also make use of a simulated annealing method that allows 
a modified posterior distribution to be sampled during bum-in. 

This modified posterior distribution is defined as 



p(Pobs|-D, /) = p(Pobs!/)p(£'|Pobs, 



(10) 



where A may take values between and 1. When A = the sam- 
ples are drawn just from the prior distribution, which will typically 
be much flatter than the likelihood function - if this is the case 
increasing A draws samples from an increasingly peaked function 
until the posterior itself is being sampled when A = 1. Sampling 
from the flatter distributions gives the sampler better opportunity to 
reach all regions of the posterior probability distribution, even in 
the presence of many local maxima. The general practice is to de- 
termine the rate at which A is increased from to 1 (the annealing 
schedule) such that it will correspond to the bum-in period. In this 
scheme, once A = 1 sampling is being made from the target distri- 
bution. The rate at which A should be increased is then subjective 
and determined by experiment. 



3.4.3 Implementation 

Our implementation of the Metropolis-Hastings algorithm is heav- 
ily based on the approach of Hobson & Baldwin (2004), as imple- 
mented in the METRO sampler code, kindly provided to us by Mike 
Hobson. We implemented the basic algorithm in modular C using 
the Message Passing Interface (MPI) framework to allow it to run 
on a cluster of multi-core computers; only the functions that im- 
plement the likelihood function and priors need be modified for a 
particular problem. At run-time the code separates into one master 
and one or more slave threads; in general it is advantageous to have 
as many slave threads as there are available CPUs. 

Each slave carries out a separate and (except as discussed be- 
low) independent Metropolis-Hastings run with bum-in (a 'chain'). 
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starting in a random point in the prior parameter space. As dis- 
cussed by Hobson Sl Baldwin, multi-threading of this kind greatly 
reduces the chances of getting stuck in a local minimum in param- 
eter space; it also allows us to speed up by a factor of the number 
of available computing cores the sampling of the posterior proba- 
bility distribution. We borrow from the METRO code the concept of 
'engines', which are different versions of the proposal distribution 
Q. Several engines are available, and the code picks the one to use 
for each trial at random, using probabilities that we have assigned 
based on our experience with the code. The active engines in the 
implementation of the code used here are 'take a random step in a 
single dimension of the problem', 'take a random jump in all di- 
mensions of the problem simultaneously', 'jump to a random point 
in parameter space' and 'jump in one dimension to the current po- 
sition of another thread' (allowing cross-mixing). The sizes of the 
random steps used in the first two engines are adaptively chosen 
during burn-in to achieve a reasonable ratio of success to failure, 
but are fixed thereafter. 

The master thread records the accepted samples of all the slave 
threads in a file, and also co-ordinates synchronization and com- 
munication between the slave threads. The final result when all 
slave threads are completed is a large file giving the co-ordinates 
in parameter space of all accepted samples both before and after 
burn-in. The density of points in a given (necessarily finite) region 
of n-dimensional parameter space is proportional to the posterior 
probability of the model parameters lying in that regions. Estimated 
value determination, credible interval estimates, plotting and evi- 
dence determination can then all be carried out using this file. The 
burn-in points are discarded for most applications, but are used for 
evidence determination, as discussed below. 



3.4.4 Model parameters and priors 

Our basic model for jets and cores (hereafter 'the basic model') has 
up to 4 parameters. 

(i) 7, the mean bulk Lorentz factor. We adopt a uniform (un- 
informative) prior in the range 1..7max for this quantity. 7max 
throughout our fitting is taken to be 5.5 for jets and 20.0 for cores. 

(ii) Pint, the mean intrinsic prominence. As this is a scale param- 
eter, we adopt a uniform prior in In(pint) between two values cho- 
sen to cover all reasonable values of parameter space. This avoids 
bias towards large values (Gregory 2005). 

(iii) a-y, the intrinsic dispersion in the bulk Lorentz factor. 
Lorentz factors of simulated sources are drawn from a normal dis- 
tribution with mean 7 and standard deviation 70-^, truncated so that 
7 > 1 in all cases. The case cF-^ — is equivalent to a delta function 
in Lorentz factor. We adopt a uniform prior for a-, between and 
0.6. 

(iv) (Jpjj^j , the intrinsic dispersion in the prominence. Since the 
observed prominence will be the product of a number of inde- 
pendent variables, it is appropriate to draw simulated prominences 
from a lognormal distribution with mean In(pint) and standard de- 
viation (Tpjjjj ; the case o"pi„t = is equivalent to a delta function in 
intrinsic prominence. We adopt a uniform prior for crpi„t between 
and 6. 

For cores, we also investigated a model (hereafter 'the power- 
law model', as discussed in Section D.4.U in which the Lorentz 
factor follows a power-law distribution between two limits, 7min 
and 7max, with a power-law index a, i.e. P(^) oc 7"". For these 
parameters we adopted uniform priors between 1 and 5 for 7min, 
between 10 and 40 for 7max, and between and 5 for p. Since 



the mean intrinsic prominence pint is a required component of this 
model, and a dispersion in the intrinsic prominence, parametrized 
as above by cTpj^^^ , may also be considered, the model has up to 5 
parameters. 

3.4.5 Likelihood calculation 

The likelihood is the probability of obtaining a given set of in- 
trinsic prominences, pobs, given the model and the priors, i.e, 
rifc ^(Pobsfc Ipmod, I). In general it is difficult to write down the 
likelihood function for the type of models discussed in Section 
13.4.41 anal vticallv. We therefore proceed by Monte Carlo methods. 
For the set of model parameters determined by the Metropolis- 
Hastings algorithm, we simulate a set of model prominences 
based on the beaming equations (equations[3]and|4l( and the disper- 
sions discussed in Section [3.4.41 and with an appropriate distribu- 
tion of the angle to the line of sight 6 (in practice we assume sources 
randomly oriented to the line of sight in all models, so 6 values are 
drawn from a uniform distribution in cos 9). By construction, these 
simulated prominences are distributed with the appropriate prob- 
ability distribution for the model being tested. The probability of 
obtaining any given observed value of the prominence, Pohs^. , with 
an associated error, assumed Gaussian, of Op^^^^ given the model 
is then given by 



-P(PobSfc|Pmod,/) 



i=i L 



(Pob 



■ Pmodi)^ 



(11) 



Essentially here we are Monte Carlo integrating over the product 
of the probability distribution for the data point and the probability 
distribution for the given model, with a suitable normalization. In 
the case where Pobs is an upper limit, we could simply write 



1 ^ 

P{PohSk IPmod, ^) ^ J^^^ KPobSfc ,Pmo. 



where 



l{a,b) = 



1, h<a 

0, otherwise 



(12) 



(13) 



In practice we slightly 'soften' the treatment of limits in the case 
where pobs^ < Pmod^ to take into account that no limit is absolute; 
thus model data points with Pmod; > Pobs^. are assigned a non- 
zero probability, again based on a normal distribution, so as to treat 
the limits as though they were So" upper limits. This approach is 
exactly valid for the limits on core prominences and an adequate 
approximation for the less well-defined jet limits. 

A two- to five- dimensional parameter space is defined by 
the possible parameters given in Section [3.4.41 The procedure de- 
scribed above effectively integrates over 9 and the posterior prob- 
ability represents a joint posterior probability for some or all of 
Pint, cTpint, 7, a"7> 7min, 7max or fl, depending on how the parame- 
ter space has been defined. We are more interested in the posterior 
probability of some of these parameters than of others: in particu- 
lar the one of most physical interest is 7. The marginal posterior 
probability of 7, or in general of any other parameter, can be deter- 
mined from the joint posterior probability by integrating over the 
other parameters: 



P(7|pobB,Af) = J dXP{j,X\pobs,M) 



(14) 



where X represents all parameters except 7. This integration can 
be carried out trivially using the output of the MCMC routine. 
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3.4.6 The evidence and the Bayes factor 

The denominator of equation ([5} is the 'evidence' and is calculated 
from the following. 

P(pobs|/) = P(Pmod, |/)P(PobBbmod, , /) (15) 

i 

Since this is the same for each Pmodi for a given model, M, it may 
be omitted in calculating the posterior probability of Pmod; (equa- 
tion [5j. However, if we want to compare models, for example a 
model in which we have 2 parameters, 7 and pint, with one where 
we have four, 7, pint, cr^ and cTpj^t , the evidence must be evaluated. 
This is not possible using only the post burn-in samples from the 
posterior, as these are, by definition, not uniformly sampled over 
the prior. Instead, it is possible to use the bum-in samples them- 
selves, as justified by the following. 

The continuous equivalent of equation l ll5t gives us the global 
likelihood for M, that is the weighted average likelihood for its 
parameters. 

P(Pob.|M)= j dXP{X\I)P(p,^,\X,M) (16) 

where X represents all parameters. 

Remembering that the function actually being sampled by the 
code is a modification of P(X |pobs, I), (equation [10} , we can de- 
fine a partition function, Z{X) as 



Z{X) 



dXP(X\M,I)P{p,^,\X,M,lY 



(17) 



dX Gxp{ln[P(X|M, I)] + A ln[P(pobs|X, M, /)]} 

(18) 

Z[\) is then the tempering simulation corresponding to A, and in- 
tegrating over all A gives us the function we want, the global like- 
lihood, 



P{Po 



\M) = r 

Jo 



= / dlnZ(A) 



(19) 



But, from equation dlSb with some rearrangement, we can write the 
derivative of ln[Z(A)] as 

d 



dX 



ln[Z{X)] = {\n[P{pob,\M,X,I)])x 



(20) 



where {ln[P(pobs| Af, X, J)])a is the expectation value of 
ln[P(pobs|A^, X, I)] and the subscript A denotes which tempering 
simulation the samples correspond to. But we can also say that 



ln[P{pob,\X,M,I) = J X{ln[P{pobs\X,M,I)])x (21) 
--yZ A,(ln[P(pobB|X, M, I)])x, (22) 

i 

So for a given model, the log global likelihood can be obtained from 
the bum-in samples and this, along with the model prior, can be 
used to evaluate the odds ratio, defined as the ratio of the evidence 
values for the two models: 



Oi.: 



P(Mi|7) P(pobs|Mi,J) 

P(M2|7) P(pobs|M2,/) 



(23) 



In this analysis identical priors are applied to the various models 
tested, so this expression is simplified to the ratio of global likeli- 
hoods, known as the Bayes factor. We can then use the odds ratio 
or, equivalently, the Bayes factor, to attempt to say which of two 
models provides the best description of the data. 



3.4.7 Credible intervals and regions 

We define the credible interval on a parameter for a confidence level 
p as the smallest interval such that the posterior probability of the 
parameter lying in the interval is p. In one dimension (i.e. integrat- 
ing over all other parameters, which in the context of the output of 
the code simply means ignoring their values) there is clearly, pro- 
vided the posterior probability has a single peak, a unique choice 
of interval that satisfies this requirement, which can be found by 
an exhaustive search over the results of the sampler, and which is 
accurate up to the constraints imposed by the finite sampling of 
the posterior. In more than one dimension, there is no such obvi- 
ous choice, and we find an approximation to the credible region 
by binning the posterior probability distribution (marginalizing out 
uninteresting parameters), sorting by the probability of each binned 
element, and taking the first m elements that sum to give the prob- 
ability p: the credible region is then approximately the region of 
parameter space enclosing these m grid elements. This procedure 
has the advantage that if there truly are multiple peaks in the poste- 
rior they will be represented correctly: it has the disadvantage that 
the results may depend on the binning and, in particular, that the 
binning may have to be quite coarse in order to define a credible re- 
gion in many dimensions. In what follows we only present credible 
intervals in one dimension and credible regions in two dimensions. 

We note in passing that this definition of the credible inter- 
val can (for a very asymmetrical posterior) actually exclude the 
position of the Bayesian estimate of the parameter, since it corre- 
sponds to the mode, while the Bayesian estimate corresponds to the 
mean; the credible interval is certainly not constrained to lie sym- 
metrically about the mean (in fact it is more likely to be symmet- 
rical about the maximum-likelihood value). This should be home 
in mind when interpreting the 'errors' that we quote on certain pa- 
rameters. 



3.5 Method verification 

3.5.1 Data simulation 

Before running the code on real datasets it is necessary to estab- 
lish whether the Bayesian estimator of the parameters of interest 
provided by the code (i.e. the mean of the values of that parameter 
for all post bum-in samples) is really an unbiased estimate of the 
true value under realistic conditions and whether the uncertainties 
(credible intervals) on parameters or combinations of parameters 
are good estimates of the true uncertainty. 

This can be done using Monte Carlo simulations (as H99 did 
for their K-S test model fitting). To do this we simulated a large 
number (50) of sets of jet data, with parameters chosen to represent 
a best guess at what resembles the real data most closely. A sim- 
ulated dataset contained 100 prominence points, each with Pobs 
simulated using equation l|3j. 

Initially Pint^ was drawn from a Gaussian distribution with 
In(pint) ~ -~7,<^Pint ~ 0-6 and we used 7 = 1.5, = 0. For 
30 per cent of sources Pobs was not used as calculated but instead 
an upper limit was generated. To generate the upper limits we used 
a random number between 0. 1 and the calculated jet prominence 
value. While the non-detection of a jet might be expected to be 
dependent on Pobs to some extent, when we considered the jet 
prominences and upper limits for the sample as a whole (Paper 
I) we found that there was no discernible trend with prominence 
with respect to upper limits. It would appear that observational ef- 
fects, particularly the variations in observing resolution, mask any 
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strong dependence of jet detection on Pobs such that there is no ob- 
vious model that should be used for generating upper limits; hence 
we adopt a random deviate method. It is reasonable to expect that 
this method, along with the high maximum upper limit relative to 
the observed upper limit values used, will only provide simulated 
data of 'worse' quality than that which is observed, justifying the 
simplifying approach in this context. For the remaining sources, 
the jet prominence was used with an associated error. This error 
was drawn from a normal distribution where the mean was the cal- 
culated Pobs and the standard deviation was variable, itself taken 
from a normal distribution with (0,0.1 xpj). This standard deviation 
was chosen to simulate as closely as possible the error distribution 
of the real jet data. 

3.5.2 Parameter estimation 

For each simulated dataset we then ran the MCMC code and de- 
rived the posterior probability distribution, from which we could 
obtain the Bayesian estimators of the value of each parameter. The 
mean and dispersion of the values of (7), (In(pint)) and {(Jp^^^) 
for each run of the code - where the angle brackets denote the 
Bayesian estimates of the model parameters - then tell us whether 
the code recovers the true values and estimates the uncertainties ap- 
propriately. For the initial trial input values of 7 and ln(p) we ob- 
tained a mean (7) of 1.52 and a standard deviation of 0.14, a mean 
{In(pint)) of -7.01 with standard deviation 0.16 and a mean (cTp^^^) 
of 0.56 with standard deviation 0.16. We can see that while the er- 
rors and limits imposed on the data result in considerable scatter 
in the returned parameters about the true (input) values, the means 
are in good agreement with those true values once the scatter is 
taken into account. Thus there is no evidence that the code is bi- 
ased for this type of dataset. Moreover, the standard deviations we 
find (which are in some sense frequentist estimates of the uncertain- 
ties on the fitting procedure, and were used in that way by H99) are 
very similar in magnitude to the credible intervals inferred from 
the posterior probability distributions for individual simulations. 
We verified that the same result holds for a range of other values 
of 7- The fitting procedure is, unsurprisingly, biased if we fit the 
wrong model (e.g., if we fit a simulated dataset with intrinsic dis- 
persion with a model in which cTp^^^ is held at zero, then we will 
recover an artificially high value of 7) but for models that match 
the data it is not. This is true for all parameters of the basic model 
and also for the parameters of the power-law model for cores when 
we carried out appropriate simulations, although we noted in sim- 
ulating power-law distributions that it is very hard to recover good 
constraints on 7max, which is only constrained by the tail of the 
prominence distribution. 



3.5.3 Model selection 

We further investigated whether the Bayesian evidence could re- 
liably be used to distinguish between models. Again concentrat- 
ing on simulated datasets with set to zero and with properties 
matched to the observed jet data, we simulated a large number of 
datasets with apj^^ > and investigated whether the Bayes factor 
allowed us to distinguish between models in which (a) the fitted 
•^Pint fixed to zero and (b) it was allowed to vary. The results 
turned out to depend on the value of Op^^^ adopted in the simulated 
data, as one would expect. For very low values of the simulated 
CTpi^j , the Bayes factor on average favoured the simpler model, even 
though it is formally incorrect: the data here simply do not give us 



enough information to require the extra parameter. For larger values 
of the input (Jp^^^, > 0.2, the Bayes factor clearly tends to favour 
the models with non-zero intrinsic dispersion. However, when we 
fitted the same data with other versions of the basic model with 
intrinsic dispersion (e.g. models where o-p^^^ was constrained to be 
zero and a-, was allowed to vary) we found that the Bayes factor did 
not reliably distinguish between the correct model and other pos- 
sible models for the dispersion; all such models gave very similar 
Bayes factors. Consistent with this, we found that the Bayes fac- 
tor in simulations did not allow us to distinguish between versions 
of the basic model with Gaussian dispersion either in the intrinsic 
prominence or in the Lorentz factor and the power-law model for 
cores; again, the Bayes factor is sufficent to point to some disper- 
sion but not to say what its origin is. 



3.5.4 Obsen'ed data 

Finally we note that two factors distinguish the jet and core data. 
The core data contains far fewer upper limits than the jet data and 
so in one sense is expected to be easier to fit. However, as is evi- 
dent from the broader distribution of core prominences (Fig.|2j and 
also expected from earlier work (e.g. H99) we expect significantly 
larger 7 values in the fits to the cores. This means that the feature of 
the data that discriminates between different models becomes the 
extent of the tail of very high prominences. However, since there 
are only a very few sources with such high values, the fit becomes 
less reliable. In a fit to 50 simulated jet data sets with input 7 ~ 5.0 
and (Tpjj^j — 0.6, we found a mean fitted 7 of 5.3 with standard 
deviation 1.0. The higher standard deviation of the fit results to the 
simulated data as a fraction of the true 7, when compared to the jet 
data, indicates that the second factor dominates: even though the 
core data are in some sense better, the fitting problem is harder and 
the results likely to be more uncertain. 

We conclude that the MCMC fitting procedure can give a good 
estimate of the true underlying beaming parameters in the presence 
of the types of error and the numbers of upper limits seen in the 
real data. In the next section we proceed to use this procedure to 
estimate parameters using the actual measurements. 



4 RESULTS 
4.1 Jets 

Since the spectral indices of jets are poorly known, we carried out 
all the modelling in this section for two representative values of 
jet spectral index, a = 0.5 and a — 0.8 (recalculating the jet 
prominence distribution consistently for the a — 0.8 case from the 
original data of Paper I.) By using these two values, which are close 
to the extreme values observed for individual objects, we can both 
evaluate the extent to which a choice of a affects our results and, 
hopefully, bracket the true values of the beaming parameters for 
intermediate spectral index values. 

We initially used the simplest possible model, with an as- 
sumed delta-function distribution in both the intrinsic prominence 
and the bulk Lorentz factor, to estimate beaming parameters from 
the prominence distribution of the jets for all 98 objects in the sam- 
ple. We then allowed the parameters governing the intrinsic disper- 
sion in prominence and Lorentz factor to vary. Results are shown 
in Fig. [3] (for the a = 0.5 case) and tabulated in Table |2] (It should 
be noted that the posterior distribution of is clearly limited by 
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the prior: the upper limit on the prior distribution was chosen so as 
to avoid generating large numbers of negative Lorentz factors.) 

Comparison of the values for the different a values in Table 
[2] shows that, as expected, the choice of jet spectral index has a 
significant effect on the result. Although the two sets of results are 
very comparable, the results for the higher spectral index have sys- 
tematically lower estimated Lorentz factors. The next thing to note 
is that the Lorentz factor for a delta-function distribution in both 
parameters is quite high: 7 = 2.37 corresponds to f3 = 0.91, and 
7 = 2.07 to /3 = 0.88, which can be compared to the /3 = 0.62 
found by H99. However, when we allow any dispersion in either 
the intrinsic prominence or the Lorentz factor, the values of 7 drop 
very substantially. 7 — 1.28 corresponds to /3 = 0.62 and 1.18 to 
/3 = 0.53: H99 found f3 ~ 0.5 for a similar model. We emphasise 
that no model allows jet speeds with 7 ~ 10; this would predict a 
much larger dispersion than is seen in the data. The ratios of evi- 
dence (Bayes factors) very strongly favour any model with intrinsic 
dispersion, but does not allow us to distinguish between the three 
different models that have this property with any degree of confi- 
dence (as we would expect from the simulations discussed above). 
In fact, when we fitted a model with 7 fixed to 1.0 and a.^ = (i.e. 
the data are modelled with only a lognormal prominence distribu- 
tion) the Bayes factor marginally favours this simplest model over 
any other, showing that the prominence data in themselves provide 
no evidence for a beaming model, while intrinsic scatter in the data 
is required. 

Since there is some evidence that the low-excitation radio 
galaxies (LERGs) do not participate in the unified models for 
narrow-line radio galaxies, broad-line radio galaxies and quasars 
(see H99 for details) we also fitted the same set of models to the 
prominence data excluding the 15 low-excitation objects in the 
sample. The very similar results we obtained are tabulated in Ta- 
ble [2] As the inclusion or exclusion of the small number of LERGs 
clearly makes little difference to the results, we do not show plots 
of the posterior probability distribution. 

4.2 Luminosity dependence in jets 

As mentioned above (Section [4. It . our best estimates of 7 for jet 
models either with or without intrinsic dispersion tend to lie above 
the corresponding values found by H99. To some extent this is ex- 
pected, since (i) we take limits and errors into account properly, 
which H99's method did not permit, and a correct treatment of the 
limits at least would be expected to broaden the effective distri- 
bution, and (ii) we are quoting not the maximum-likelihood value 
but the Bayesian estimator of the Lorentz factor, J 7p(7)d7, which 
will be biased towards higher values with respect to the maximum- 
likelihood estimator if p(7) has a long tail to higher values, as it 
undoubtedly does in some fits (Fig. [3}. In addition, H99 used a 
different definition of prominence, normalizing with respect to the 
total high-frequency flux, which might have tended to reduce the 
scatter in the observed prominence if any of the high-frequency 
structure (e.g. hotspots) had been affected by beaming. 

However, it is worth asking whether any component of this 
difference could correspond to a real physical difference between 
our sample and that of H99. The two samples overlap to some ex- 
tent - of the 3 1 NLRG studied by H99, 22 are also in our sample 
- but the main difference is that H99 considered only objects with 
z < 0.3, and hence largely with low luminosities. We began our in- 
vestigation by fitting our models with (Jp.^^ = and cr.^ = to the 
jet data of H99, for consistency treating upper limits as detections 
and assuming a constant fractional error on each prominence mea- 



surement. For consistency with the convention adopted by H99, we 
use a — 0.8 throughout this comparison. We found a Bayesian es- 
timate of 7 = 1.65^0 (/3 = 0.80), which is considerably larger 
than the best-fitting value found by H99, /3 — 0.62. This suggests 
that there is some difference in the different fitting approaches, and 
perhaps reinforces the notion that H99's approach of fitting using 
the K-S statistic was not ideal. If we allow crpi„t to vary in the fits, 
which was the most sophisticated model considered by H99, we 
obtain 7 — I.IOIq q^; this corresponds lo f3 — 0.43, which is very 
similar to the f3 ~ 0.5 reported by H99. 

Although this is at best qualified success in reproducing the 
results of H99, we noted that the values of 7 for the two models we 
fitted lay systematically below the results we obtained for fitting 
the same models to our sample as a whole (Section [4. II Table |2l(. 
This motivated us to look for luminosity effects in the jet speeds in 
our sample. For simplicity we divided the whole sample, including 
LERGs, at a 178-MHz luminosity of 5 x lO^*' W Hz"^ sr"\ which 
is the cutoff luminosity we used in a number of statistical tests in 
Paper I. This gives a sample of 40 low-luminosity and 58 high- 
luminosity objects. When we fitted the model with (Jp^^^ = and 
— Oto these two datasets (here using a — 0.5 only), we found 
that 7 = 2.37^0.11 for the low-luminosity sample and 1.92l^;j;^ 
for the high-luminosity one, a marginally significant difference but 
one that is in the opposite sense to that implied by the results above 
from the H99 data. When we fitted the model with a-, — but o"pi„t 
free, we found 7 = 1.25io'2^ for the low-luminosity sample and 
1.55+ 

0.14 fo'' high-luminosity one. However, the intrinsic dis- 
persion for the low-luminosity sample is higher (crpj^^j = 1.041qi2 
versus 0.65^0 17) so it is plausible that we are simply seeing differ- 
ent tradeoffs in the somewhat degenerate plane of 7 versus o-p.^^ . 
If, finally, we fix Up.^^ to its best-fitting value of 0.95 for the whole 
sample (Table |2j and repeat the fits, the estimates of 7 become 
1.291q 5g and 1.33lQ j^g respectively; these are not significantly 
different from each other or from the estimate for the sample as a 
whole. We conclude that there is no convincing evidence for a trend 
in 7j with source luminosity. 

4.3 Cores 

Using the basic model, we find that, for a delta-function distribu- 
tion in Lorentz factor and intrinsic prominence, pint and 7 are con- 
strained to lie along a line in parameter space, as found in previous 
work (see Hardcastle et al. 2003 for the equation of this line) and 
we can only really say that 7^10 (the limit 7 < 20 is imposed by 
the prior). However, when any intrinsic dispersion is introduced, 
7 becomes reasonably well constrained, with Bayesian estimators 
in the range 10-14, though again this is affected by the choice of 
prior. As in the case of the jets, the Bayes factors for these mod- 
els strongly favour some intrinsic dispersion, but we cannot distin- 
guish between different dispersion models. Models with no beam- 
ing and only intrinsic dispersion are not favoured in this dataset, 
presumably because the tail of the prominence distribution is not 
well modelled with a lognormal distribution. 

Again, we also fitted the same set of models to the subsample 
that excludes the LERGs. Here the tendency is to reduce the values 
of 7 required - perhaps because the LERGs include a significant 
number of both core non-detections with strong upper limits and 
bright cores with high prominence values. The results of fitting the 
basic model to both datasets are tabulated in Table[3] 

We then estimated the parameters of the power-law model 
for the core prominence distribution. These results are tabulated 
in Table |4] We initially fixed o-p^^.^ to 0, so that all the dispersion 
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7 7 

Figure 3. The posterior probability distribution for fits to the jet prominence distribution for the whole sample. Contours show the smallest region that contains 
68 per cent of the posterior probability distribution, i.e. the two-dimensional Bayeian credible region as defined in Section |3. 4. 7| The white cross marks the 
position of the Bayesian estimator of pi^t and 7. Top left: a delta-function distribution is assumed for both the intrinsic prominence and the Lorentz factor (i.e. 

= 0, CTp.^^ = 0). Top right: cr-, = but (Tpj^j is allowed to vary. Bottom left: 0"^.^^ = but cr-, is allowed to vary. Bottom right: both 0"^.^^^ and c-y are 
allowed to vary. All plots are marginalized over (Tpj^^^ and for ease of comparison. 



comes from the power-law distribution of 7; with this prior we ob- 
tain some constraints on the remaining parameters of the model, 
though they are still strongly affected by the choice of prior (partic- 
ularly 7max, which we expect from the simulated data to be un- 
constrained). We attempted a fit with a fixed a value of 2.0, as 
favoured by some previous work (e.g. Liu & Zhang 2007) but this 
is strongly disfavoured by the Bayes factor. When (Jp-^^^ is free, the 
parameters of the power-law model are essentially unconstrained, 
although the Bayes factor favours such a model. It can be seen that, 
as expected from the simulations, these data give us no reason to 



favour a power-law distribution of Lorentz factors over a lognor- 
mal one; the very slight difference in Bayes factors in favour of the 
power-law model when Op^^^ is free is not significant. 

4.4 Luminosity dependence in cores 

In Paper I, we suggested that the statistically significant differ- 
ence in core prominence seen in our low-luminosity and high- 
luminosity samples might be evidence for a dependence of Lorentz 
factor on source luminosity. To investigate this we divided the 
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Table 2. Results of fits to the jet prominence distribution. Fits are carried out for two values of the assumed jet spectral index, a. The Bayes factor quoted is 



the natural log of the ratio between the evidence value for the a.y 



0, 



fit (for a given sample and a) and the others; thus it gives a measure of 



the degree to which the other fits are preferred. EiTors quoted are 68 per cent credible intervals marginalizing over all other parameters. Where no en'ors are 
quoted, the quantity concerned was fixed at the stated level. 



Sample 


a Model 7 


In(Pint) 






Bayes factor 


Jets, all sources 


0.5 a-y = 0, = 2.37l^'^? 
'^■y = 0. ^P,nt free l-28t»;i; 

free, = ^-i^tVir 
a-y free, <Jp^^^ free 1.27^q ]^| 
7 = 1.0, (Ty = 0, (Tpjjjj free 1.0 


-8.42i;i;* 
-8.29in^ 

-».2U_o og 

_7 qi+0.11 
'•'^^-o.io 




q^+O lO 
'^■^3-0. 09 



76+015 
'J- '0-0.13 

1 10+0.05 
^■J^°-0.06 








19.4 
20.0 
19.5 
21.0 



0.8 = 0, o-pi„t = 

^1 ~ ^' '^Pint free 

cTy free, fJp.^^^ = 
free, (Tpj^j free 
7 = 1.0, = 0, (Tp 



2.07 
1.18 
1.24 
1.16 



free 



Jets, LERGs excluded 0.5 cr-y = 0, crp.^ 
(Tt = 0, o-„.„ 



= 
free 
= 



(T'^i free, fJp.^ 
Tee, o-pjj^ 
1.0, cr-y = 0, CTp.^^j free 



2.43 
1.32 
1.51 
1.31 



0.06 
0.05 
0.07 
+0.09 
0.15 
+0.04 
-0.16 
1.0 
~U.U4 



-8.50 
-8.36 
-8.18 
-8.25 
-7.90 



0.14 
+0.11 

0.12 
+0.14 

0.19 
+0.15 

0.19 

1.0 



-8.42 
-8.35 
-8.12 
-8.24 
-7.97" 



+U.05 
0.09 
+0.08 
0.03 
+0.10 
0.10 
+0.06 
0.07 
+0.11 
0.09 
+U.UV 
0.13 
+0.07 
0.06 
+0.12 
0.14 
+0.06 
0.07 





fO.08 
-0.07 



^+0.19 
-0.15 
1 11+0.04 



0.94^ 



0.64^ 



0.45 
0.32 






+0.11 

0.06 
+0.15 
0.13 




+0.07 
0.05 



0.94 

0.73 
1.19 





+0.09 
0.11 


+0.18 
0.13 
+0.05 
0.05 






46+"" 
0-^0-0.04 

'^■'^^-0.14 




19.2 
20.2 
19.3 
21.0 



14.4 
15.6 
14.5 
15.8 



0.8 


o"7 = 0. 0"Pi„t = 


9 11+U.ULi 
^•-^-^-0.08 


-8.50i;i;* 












= 0, Cpjjjj free 


1 91 +0.06 
^■^^-0.10 


qO + 0.06 

-»-3»_o,o6 





97+0 " 
"■'"-0.07 


15.6 




free, <Tp^^^ = 









45+013 
'^■^^-0.06 


15.4 




aj free, (Tpj^^j free 


1 9r,+0.09 
^^■^'^-0.16 




O-OJ^-0.17 


O-'5*-0.12 


14.0 




7 = 1.0, 0-7 = 0, cTpjj^j free 


1.0 


_7 94+0.05 
' ■='^-0.06 


1 1 C-+0.03 





15.5 



sample of core prominences by luminosity in the same way as 
was done for jets in Section 14.21 Fitting only the version of the 
basic model in which (Jp^^^ is free, since, as discussed above, 
only models with some intrinsic dispersion give reasonably well- 
constrained 7 values, we found 7 = 11.20i^:^^ for the low- 
luminosity sources and 10.67^^43 for the high-luminosity sources. 
Thus there is no evidence for differences in Lorentz factor, al- 
though clearly 7 is poorly constrained. The intrinsic normaliza- 
tion values for the low- and high-luminosity sample are respec- 
tively — 4.70+J'55 and — 6.06+Q g4, while the estimates of (Tpj^^j are 
1.40lg'24 1-19-0 23- The most obvious interpretation of these 
results is that, if any luminosity dependence is present in our data, 
it is a luminosity dependence of intrinsic prominence rather than of 
bulk Lorentz factor; either explanation is of course equally good in 
terms of reproducing our original observation of a difference in ob- 
served prominence. Given these results and the fact that in any case 
the data do not appear to allow us to distinguish between any but 
the simplest of models for core Lorentz factor distribution, we have 
not attempted to fit a model in which core Lorentz factor depends 
on source luminosity to the data. 

4.5 Core-jet correlation 

Fits to the jet and core prominence distributions alone do not take 
into account the correlation between the two quantities. In a model 
in which the prominence distributions were completely dominated 
by intrinsic scatter, we would expect no correlation between these 
two quantities, but in fact we showed in Paper I that there is such a 
correlation which is significant even in the presence of upper limits. 
It is straightforward to modify our approach to simulate the two- 
dimensional probability distribution of jet and core prominences 
and to sum the likelihoods of objects in (jet, core) prominence 
space. Accordingly we carried out some fits using this approach, 
fitting, as before, both to the full database and to the subset of 
sources that excludes the LERGs. Since we have seen that intrinsic 



dispersion is required in the fits to the two quantities individually, 
we included this in our two-dimensional models; for simplicity we 
restricted ourselves to a model in which there are intrinsic disper- 
sions in the prominences but not in the beaming speeds, so that the 
free parameters of the model are pint^, cTp , 7c, Pint , o"p. and 

^ m c J 111 j 

7j- 

Results of these fits are tabulated in Table |5] again allowing 
the jet spectral index to be either 0.5 or 0.8, and an example of 
the posterior probability distribution for the beaming parameters is 
shown in Fig. [5] It can be seen that beaming is strongly required by 
the Bayes factor between these models and the corresponding ones 
in which the Lorentz factors are fixed to 1.0, as we might expect. 
The effect of including the core-jet correlation in the modelling is to 
make less probable the regions of parameter space with no beaming 
(compare the right-hand panel of Fig. |5] with the top right-hand 
panel of Fig.[3}. The Bayesian estimates of the beaming speeds are 
therefore slightly higher than in the case where the correlation is 
not taken into account (7j = 1.35 corresponds to /3j = 0.67, and 
7j — 1.23 to /3j ~ 0.58) although there is substantial overlap in the 
credible intervals. Once again, there is little difference between the 
results for all sources and the results that exclude the LERGs. 



5 DISCUSSION AND CONCLUSIONS 

The models for the prominences of both jets and cores are a func- 
tion of three variables, (pint 7, 0), so that the problem of fitting to 
the 7 distribution is degenerate. However, as discussed in Section 
13.4. II prominence distributions simulated using these these models 
do represent the sample data well, and are strongly influenced by 
the value of 7. The model fitting that we have performed in this pa- 
per has allowed the upper limits in the data to be treated correctly 
and so the following conclusions can be drawn about the probable 
range of 7 in the jet and core data. 

For both jets and cores, it was found that models allowing 
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Figure 4. The posterior probability distribution for fits to the core prominence distribution. Labelling as for Fig. [5] Top left: a delta-function distribution is 



assumed for both the intrinsic prominence and the Lorentz factor (i.e. cr-y 



0, 



0). Top right: cr-y = but o"p;„t is allowed to vary. Bottom left: 



but (T^ is allowed to vary. Bottom right: both cr^^^^ and a.^ are allowed to vary. All plots are marginalized over Op^^^ and a^y for ease of comparison. 



Table 3. Results of fits of the basic model to the core prominence distribution. Conventions as for Table|2] 



Model 

= 0, (Tpi^j = 



Sample 



7 

+'i.m 

4.71 
+3.15 
-.54 
+2.58 
2.44 
+3.03 
5.91 

1.0 



ln{pint) 

5r 

18 

63 
68 



Bayes factor 



Cores, all sources 



= 0, o-pi„t free 
CT-y free, o^pl^^ = 
free, Upj^t free 

7 = 1.0, (T-y 



0. f^Pint free 



14, 
10. 
14. 
10. 



0.28 
1.31 
0.72 
+0.40 
0.24 
+ 1.02 
0.84 
+0.11 
0.12 



1.43 



1.23 
2.15 





+0.12 
0.13 


+0.21 
0.17 
+0.09 
0.11 






n 57+0.03 
'^■^'-0.01 

U.OO_Q Qg 




33.6 
32.9 
33.7 
32.7 



Cores, LERGs excluded aj = 0, ffpi^t = 


1 1 Qfi"'"'^' 

il.yD_g gg 


-6 69+^-^' 

"•"^-0.53 










(Ty = 0, (Tpi^^ free 


8.2412-35 


-6.281?;^^ 


1 O1+0.11 

-^••^-^-0.16 





27.8 


(T-y free, <Tp^^^ = 


12.66l|;^^ 


t; co+0.91 
^•^^^-0.28 





n 55+0-05 


27.9 


a~/ free, cp.^j free 


8.8412.49 


-6.131-;^ 


1 09+0-22 
i.uy_Q 21 


34+0.26 


28.0 


7 = 1.0, cr-, = 0, (Tp.^^j free 


1.0 




9 no+o.oo 

^•"^-0.08 





27.8 
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Table 4. Results of fits of the power-law model to the core prominence distribution. Conventions as for Table |2] The Bayes factor is with respect to the 
corresponding model with = 0, o-p^^^ = in Table[3] 



Sample 


Model 


In(pint) 


0"Pint 


7miii 


Tmax 


a Bayes factor 


Cores, all sources 


0"pj 


nt 


= 0, a free 


-6 10+"'^* 

D-J^U_g 23 





^■-^-^-0.25 


30.741-- 


'-'■^0-0.15 


31.1 




Cp- 


nt 


= 0, a = 2 


o -,9+0.76 
■^•^^-0.26 





2 49+0 *9 


29 46+i0-^* 


2 


24.0 




0"pj 


nt 


free, a free 


'-0.35 


1 40+°-2i 

^■^'-'-0.23 


q 77+1.17 
''■ ' ' -0.38 


24.24^5^29^ 


2 eo+^o^ 

Z.DU_j j^Q 


35.0 


Cores, LERGs excluded 


0"pj 


nt 


= 0, a free 


"•'^-0.34 





3 47+U « 

■^■^'-0.69 


26.56+^;!;^ 


'J ■^'J- 0.42 


26.9 




O'p- 


nt 


= 0, a = 2 


7 n7+0-20 
' -0.26 





•-'■="^-0.53 


28.12ljigf 


2 


24.1 




Cp- 


nt 


free, a free 


-7 in+0-48 


-, r,-|+0.25 

-^■^-^-0.27 


o 44+0.78 
■^■^^-0.89 


24.35t5f9g 


^■O'J_0.85 


28.8 



0.5 and a = 0.8. Conventions as 



Table 5. Results of joint fits to the jet and core prominence distributions for the whole sample for jet spectral indices a 
for Table|2] but the Bayes factor is measured with respect to the model with free beaming parameters. 

Sample a Model > In(pintc) '^Pintc In(pintj) <^Pint- Bayes factor 

TT-ra YTnTTrn 1 ,7+U.Uti _8.29+"-^^ n Q7+'t-U'i ~" 



7c 
+b.3V 
4.74 

1.0 

+6.57 
-3.55 
1.0 

+y.5B 



7j 

■m- 

.12 

1.0 

.03 
.08 

1.0 



AU sources 



0.5 All free 11.81 

No beaming 
0.8 All free 11.76 

No beaming 



-5.17 
-5.18 



+ 1.10 



1.49"* 



1.23^ 



-8.36"* 



0.97 



0.98^ 



+ 1.55 



= +0.13 



+0.13 

-0.13 



-U.06 



-10.0 



-8.0 



LERGs excluded 0.5 



0.8 



All free 
No beaming 
All free 
No beaming 



10.49 



9.48 



7.29 

1.0 

+3.27 
6.30 
1.0 



-5.61 
-5.85 



1.393 



1.20] 



.13 
1.0 
0.02 
0.05 
1.0 



-8.39 



1.07 



;.4i'' 



1.05 



+0.07 
-0.08 



-4.0 
-4.2 



some dispersion in one or both of the intrinsic distributions of pint 
and 7 are favoured over the model that assumes a delta-function 
distribution for both parameters. For the jets, the value of 7 found 
for these models varied between 1.18 and 1.49 (see Table |2l(, de- 
pending on the model and the choice of a, which corresponds to 
/3 ~ 0.53 — 0.74 and is in reasonable agreement with previously 
reported analysis, as discussed in Section fTTTI Models with 7 10 
did not fit well to the data and there was no evidence for any lu- 
minosity dependence of 7. For the cores, a reasonably well con- 
strained 7 of ~ 10-14 was obtained in models in which intrinsic 
dispersion in the prominence was allowed, but the results here are 
dependent on our choice of prior. 

One obvious criticism of the core data modeling is that it has 
been assumed that the properties of the twin parsec-scale jets are 
the same and that the jets are bi-polar. More complicated models 
could be defined that allow the pint and 7 values to be different 
in the approaching and receding jet, and allow the separation be- 
tween the jets to be less than 180° - even if the real parsec-scale 
jets properties are similar or close to being symmetrical in the two 
beams, it is entirely plausible there is at least some deviation from 
exact symmetry. Experimenting with such models, however, deter- 
mined that the data does not justify models any more complex than 
those reported in the preceding sections and Table |2] That is, the 
evidence values associated with the models with greater numbers 
of parameters are not higher and fitting results do not differ greatly 
from those obtained with the simpler models. Considerably larger 
datasets will be needed to investigate more complex models. 

We also carried out fits to the combined jet and core promi- 
nence datasets, as evidence for a jet-core correlation has been found 
in our sample and in others (Bridle et al. 1994; H99; Paper I). 
The results showed that models with no beaming are strongly dis- 
favoured with respect to models in which beaming is included, 
since only the latter can reproduce the observed core-jet correlation. 
The fitted 7 value for the kiloparsec-scale jets was slightly higher 
than for the corresponding fits in which the correlation was not 
taken into account - 7jot = 1.35, which corresponds to /3 ~ 0.67, 
for a = 0.5, or 7jct = 1.23, P ~ 0.58, for a = 0.8. Even so, 
these values, which represent our best estimates in the sense that 



they make use of all the available data, are still clearly much lower 
than 10. 



How can these results, implying mildly relativistic bulk speeds 
for kpc-scale jets in FRIIs, be reconciled with the much larger val- 
ues of jet bulk Lorentz factor required by beamed inverse-Compton 
models of X-ray emission from quasar jets (Section 11. U ? As we 
have already pointed out, jet velocity structure is routinely invoked 
in models of the parsec-scale jets in order to explain the wide range 
of Lorentz factors required by different observations in those cases. 
If a beamed inverse-Compton model is viable at all for kiloparsec- 
scale jets, velocity structure must be present there as well (as con- 
cluded by Hardcastle 2006). Our data are dominated by objects at 
large angles to the line of sight, and so would naturally be expected 
to give estimates of the bulk Lorentz factor that are appropriate 
to the slow-moving component of the jet. However, the presence 
of this slow-moving component in observations of jets even at a 
small angle to the line of sight certainly adds to the complications 
of detailed inverse-Compton modelling of FRII sources; the radio 
emission from the jet is the only independent information avail- 
able on the electron density present, but the assumption that all the 
radio-emitting material moves at a single bulk speed which can be 
estimated from the X-rays is no longer valid in the presence of jet 
velocity structure. Ideally measurements of jet prominences could 
be used to constrain models with jet velocity structure, but even if 
our data were good enough (and we have seen that our capability 
to distinguish between models is limited only to the most obvious 
cases) there is as yet no constraint on how the jet speed and emissiv- 
ity might vary as a function of radius, and so effectively no model to 
test. High-resolution radio observations of jets in FRIIs with next- 
generation instruments such as e-MERLIN and the EVLA are re- 
quired to give observational constraints on the surface brightness 
of FRII jets at a range of angles to the line of sight as a function 
of jet radius. It may then be possible to apply the techniques de- 
scribed here to constrain the properties of a more complex model 
of jet speeds. 
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